ON THE INTERPRETATION OF FREQUENCY CURVES 

IN BIOLOGY 

by 

E. F. DRION. 

(from the Botanical Institute Government University Leyden) 


TABLE OF CONTENTS. 

page 

CHAPTER I. THEORETICAL .INTRODUCTION. 78 

§ x. Introduction . 78 

§ 2—8. The necessary conditions that a frequency 

curve may have a quantitative value .... 80 

§ 9. Possible conclusions from given frequency 

curves .. 89 

§ 10—12. The hypothesis of Quetelet . 91 

§ 13. The methods of Pearson . 96 

§ 14. The method of Edgeworth and Kapteyn .. 97 

§ 15. The methods of ’Student’ and Fisher .... 100 

§ 16. The methods of Kapteyn and Van Uven 

used to compare frequency curves . 101 

CHAPTER II. STATEMENT OF THE PROBLEM, 
MATERIAL AND METHODS. 

§ 1—2. Statement of the problem . 102 

§ 3. The experiments with Rohinia Pseudacacia 103 

§ 4. The experiments with Kalanchoe verticillata 104 

CHAPTER III. A BRIEF SURVEY OF THE RESULTS. 108 
§ 1. The results of the measurements of the foliola 

of Rohinia Pseudacacia . 108 

§ 2. The results of the experiments with Kalan¬ 
choe verticillata . 112 

CHAPTER IV. SUMMARY AND CONCLUSIONS. 121 

APPEND. DERIVATION OF FORMULAE . 123 

LITERATURE . 128 















78 


I. THEORETICAL INTRODUCTION. 

§ x. If one measures the same characteristic of a great number 
of individuals of one species (or of a great number of equal organs 
of one individual), one generally will find different values. If all 
values occurred equally often life would be impossiblerats as big 
as elephants and men as small as fleas would exist. And if the 
size of all the characteristics was distributed at random all indivi¬ 
duals would have a different form. Happily there seems always, 
to exist a certain system in the results of our measurements. Indivi¬ 
duals showing a certain average of the measured characteristic are 
most frequent, while individuals with a deviating value occur the 
rarer the greater the deviation from this average is. In order to 
compare two different characteristics of one species, or the same 
characteristic of two different species, it is necessary to measure a 
great number of individuals, for if one has measured only one 
individual for each characteristic (or for each species) it would be 
impossible to deduce a general law from the results. Perhaps one 
would find just the reverse in repeating the experiment. To get a 
good survey of the results it is desirable to plot them in a fre¬ 
quency curve, this means that one plots the results of the measure¬ 
ments against the number of individuals in which that result is 
found. 

To make the argumentation less abstract we will limit ourselves 
to the discussion of the frequency curves of the length of plants, 
and we will suppose that this length is only the result of a growth- 
process. This means that we suppose that the differences in the 
initial-length are small compared with the differences in the final 
length. In macroscopic methods of measuring this will generally 
hold, in microscopic often not. The length of a cell for instance 
will not only depend on its growth but also on its length directly 
after the division of its mother cell. Almost all frequency curves 
of gradually changing objects, however, originate in a manner 
comparable to a growth-process. 

As we have said already, a measurement performed on a single 
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individual has 'but little value. When repeating the measurement 
on another individual, or even if possible on the same individual, 
probably another result would have been found, on account of 
the variability of biological material. Therefore one. tries to elimi¬ 
nate this variability by measuring a great number of individuals. 
But the result of the measurements of a great number of individuals 
may always be plotted as a frequency curve. Therefore the mea¬ 
suring of a great number of individuals means that one supposes 
that, under certain circumstances, a frequency curve is less variable 
than a single individual, and that one expects to find a nearly 
identical frequency curve when repeating the experiment. For 
only in that case a frequency curve will have any value. Therefore 
the questions that should be answered before any quantitative 
biological work is possible are: 

1. Which circumstances will enable us to draw any conclusions 
from frequency curves? 

2 . What may be concluded from given frequency curves? 

To answer these questions we will follow this way: As it is 

supposed that the length of a plant is only the result of a growth- 
process, the factors which determine the speed of growth at a 
certain moment must be determined. Then it will be possible to ” 
answer the question how it is that, when measuring a great number 
of individuals, one finds on the one hand many different lengths, 
on the other hand many individuals of the same length, in other 
words how a frequency curve is generated, and also which causes 
determine its form. It will turn out that under certain conditions 
the form of the frequency curve does not depend upon the exact 
magnitude of the factors which determine the growth velocity. 

An accurate study of these conditions will give the answer to 
the first question and even a qualitative answer to the second 
question, since it will turn out that under these conditions a 
repetition of the experiment shall give a nearly identical frequency 
curve. 

Question 2 generally may be put in this form: 

Two frequency curves are given. It may be that the conditions 
mentioned are fulfilled. Are the differences between the two curves 
small enough to make it probable, or - large enough to make it 
improbable that this hypothesis is true? A complete answer to this 
problem may only be given after a further discussion of the form 
of the frequency curve. It will turn out that it is impossible to 
determine this form ’a priori’, and that all previous theories which 
attempt to do this were incomplete or premature. Methods for the 
comparison of frequency curves that are based on those theories* 
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are not acceptable. The drawbacks of the method of curve-fitting 
will then be discussed. 

Finally the method of Kapteyn and van U v e n will be 
discussed as a method to compare frequency curves. It will be 
shown that with its help it is often possible to give a positive 
answer to question 2 . 

§ 2 . The growth rate of a plant depends on two kinds of 
factors: the internal and the external. 

The most important internal factors are the hereditary qualities 
of the plant. It is clear that generally two plants with different 
hereditary qualities shall have different growth velocities, for 
differences in hereditary qualities appear only as differences in 
reaction to internal or external stimuli. 

Age is also a very important internal factor. This is very clearly 
shown by some perannial plants, which have a long period of 
growth. The growth velocity of these plants, when grown under 
constant external conditions, is not constant. At first they grow 
slowly, then their growth rate increases, but finally it decreases 
again. As the external conditions did not change this proves that 
the internal conditions of the plants were changed during their 
development. 

But if two plants have the same hereditary qualities and the 
S' me age even then they generally will not have the same internal 
conditions. For even if their internal conditions were equal origi¬ 
nally, differences in the external conditions will make them unequal 
after a certain time. 4 

We shall call the internal conditions as far as they are only 
the result of age and of the external conditions the constitution . 
Differences in constitution (in this sense) are therefore never a 
result of differences in hereditary qualities. It is impossible to 
compare the constitutions of plants with different hereditary 
qualities. 

The differences in constitution of two plants as a result of 
differences in the external conditions may be permanent. This will 
happen if the plants were influenced by different external condi¬ 
tions during a susceptible period. The susceptible period is a time 
during which e.g. an organ is formed. If the external conditions 
influence this formation, two plants which were influenced by 
different external conditions during this time will remain different 
during their entire life. 

The difference in constitution between two plants, as the result 
of different external conditions during a non-susceptible period 
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generally may be annulled, and even reversed by opposite diffe¬ 
rences in the external conditions. But if the external conditions 
were very bad for one of the plants, it may be that its constitution 
has deteriorated permanently, an extreme case of which is death. 

§ 3 . The external factors: water, light etc. also have a great 
influence on the speed of growth. Generally every change in the 
magnitude of these factors will cause a change in the growth 
velocity. The magnitude of this change in the growth rate resul¬ 
ting from a certain definite change in the magnitude of one of 
the factors is a function of the magnitude of all factors. The 
value of this function becomes very small if the magnitude of 
any of the factors, except the varying factor, drops below a 
certain limit, or if the varying factor surpasses a certain limit. 
In other words, if all the factors have a certain magnitude, the 
speed of growth will only change appreciably if the magnitude 
of one well-defined factor changes. This law is called the law 
of the limiting factors, or Blackman’s law. In our case the factor 
whose change results in a appreciable change of the speed of 
growth is called the limiting factor. If two plants of the same 
age, the same heredity and the same constitution, are placed under 
the same external conditions, then their speed of growth will be' 
equal. If for one of the plants one of the external conditions is 
altered this only will noticeably influence its speed of growth if 
that condition happened to be a limiting factor. 

§ 4 . We have supposed that the length of the plant is only 
determined by a process of growth. Therefore it will depend 
primarily on the time that process lasted, and secondarily on the 
growth rate at every moment of its existence. In general plants 
of different age will have different length. There are cases where 
the difference in length is primarily caused by a difference in age 
as in the case of the length of Spirogyra - and Chaetomorpha-ct 11s. 
(Investigations of Baas Becking and Baker). After the cells 
of those algae have reached a certain length they divide, and start 
growth again. So the length of their cells is chiefly determined 
by the time lapsed since the last division. 

On the other hand it happens that organisms (or organs) do not 
grow any more after a certain age: they are ’’full-grown”. The 
length of these organisms will not depend on their total age, but 
only on their age when mature. The moment of maturity may 
depend on internal as well as on external conditions, as e.g. the 
flowering of the terminal-bud, which process may be determined 
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by the external conditions if the bud has reached a certain 
development. 

The length of plants of the same age depends on their speed 
of growth on all preceding moments, which, on its turn, depends 
on the internal and external conditions during those moments. 
Therefore, the length at a certain moment will also depend on 
those conditions. Most important of the internal conditions are 
the hereditary factors. This has been proved by Johannsen 
in his experiments with beans. Johannsen planted a great 
number of beans belonging to one sample. The mean length of 
the beans was different from plant to plant. If he planted a num¬ 
ber of 'beans originating from the same plant, then the mean 
length of the beans from those plants was the same for all plants. 
And if he now planted beans originating from different plants 
(but having the same P 2 ) the mean length of the beans which 
came from those plants was also the same for all plants. As 
beans are always self-fertilized, all the descendents of one bean 
will have the same hereditary qualities. This experiment proves 
that the mean length of plants of the same heredity is equal, when 
grown under practically equal external conditions. Therefore we 
may conclude that the hereditary qualities of the beans of the 
original sample were unequal, for the mean length of the beans 
of the Fi generation was unequal. Consequently the hereditary 
qualities of plants of the same species are not necessarily the same. 
In order to be certain that the hereditary qualities are equal, one 
must work with plants that are the vegetative offspring of one 
mother plant. And even then it is always possible to obtain a 
bud-variation! A reasonable certainty as to the genetic homogeneity 
exists in plants which are self-fertile, and if one uses the offspring 
of one plant. 

In the second place the length of plants of the same age will 
depend on their constitution on every previous moment. To recapi¬ 
tulate: Constitution is the internal condition as far as it is 
determined by the age and the external conditions on all pre¬ 
vious moments. Because the sprouts or seeds have passed through 
a long period of development on the mother plant it is possible 
that the plants have a different constitution on the moment of 
planting. In our experiments with Kalanchoe verticillata S. Elliot 
we have measured, three weeks after planting, the width of the 
leaves which were formed on the mother plant. After eight 
months the length of the largest plant where this leaf was 
narrower than 5.45 m.m. was not larger than the length of 
the smallest plant where this leaf was broader than 7.60 m.m. 
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In each group there were more than 60 plants. This proves that 
plants with a different width of the first leaf have a different 
constitution, and this constitution may differ so much that after 
eight months it is still apparent from the length of the plants. 

If during a susceptible period, the plants grow under dif¬ 
ferent external conditions this will of course highly influence 
the ultimate length of the plants, for in that case the consti¬ 
tution will always be different. 

The external conditions . 

§ 5 . Except in experiments in the laboratory the external 
conditions are rarely the same during the whole development 
of the plant. But even on the same moment they generally will 
not be the same for different plants. Therefore, even if the 
constitution and the heredity of two plants are equal, their 
speed of growth will as a rule differ, because of the differences 
in the external conditions. On the other hand it is possible 
that two plants with different constitutions, or even different 
hereditary qualities, have the same speed of growth as a con¬ 
sequence of differences in the external conditions. But as the 
external conditions are usually not quite constant this situation 
will not be permanent as a rule. 

It is perhaps impossible to grow plants under absolutely equal 
external conditions. But it is certainly not impossible to make 
the conditions so similar that none of the plants is always ex¬ 
posed to more favourable conditions than the other, and that 
for no period one is able to indicate a condition that is better 
for one plant than for any other plant. Such conditions we will 
call „practically equal”. 

But even if the plants are exposed to practically equal con¬ 
ditions, yet there remain differences that one time will favour 
one of the plants, another time an other. Besides it is very 
well possible that two plants may not equally utilize the same 
favourable circumstance. E.g. the sun is shining some moments. 
By chance, one of the plants has its leaves perpendicular to the 
direction of the sun rays, the leaves of the other plant are parallel 
to this direction. 

An other example is given by the budding of new roots. Some¬ 
times the rootlet will find a grain of sand just opposite itself. 
Than it will take some time before it has passed along that 
grain. In an other plant a young root will appear between 
some grains and at once it may start to grow. This are all fortui- 
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tous differences. For fortuitous I want to give the definition of 
Poincare: 

„Une cause tr£s petite, qui nous echappe, determine un effet considerable 
que nous ne pouvons pas ne pas voir, et alors nous disons que cet effet est 
du au hasard. Si nous connaissions exactement les lois de la nature et la 
situation de l’univers i Linstant initial, nous pourrions predire exactement la 
situation de ce meme univers & un instant ultirieur. Mais lors m&me que les 
lois naturelies n’auraient plus de secret pour nous, nous ne pourrons connaitre 
la situation initiate que approximativement. Si cela nous permet de prevoir la 
situation ulterieure avec la meme approximation , e’est tout ce qu’il nous faut, 
nous disons que le phenomene a 6te prevu, qu’il est regi pas des lois; mais il 
n’en est pas toujours ainsi, il peut arriver que des petites differences dans les 
conditions initiates en engendrent de trbs grandes dans les phenomenes finaux; 
une petite erreur sur les premieres produirait une erreur enorme sur les der- 
nieres. La prediction devient impossible et nous avons le phenomene fortuit. 
(Science ct Methode, pag. 68). 

It is clear that in biological events we have often such a 
disproportion between the differences in the causes and the 
differences in the effects: a very small difference in the direc¬ 
tion of the first named rootlet — and it would have grown at once. 

According to Poincare there is still another case where 
we speak of fortuitous events, i.e. if the causes are very complex. 
In that case the final situation is largely independent of the 
initial situation. But when are the causes complex enough to 
cause this effect, and when are they too simple to cause this 
effect? 

„On voit ainsi ce que Lon doi: entendre par conditions trop simples; ce sont 
celles qui conservent quelque chose, qui laissent subsister un invariant. Les 
Equations aifferentielles du probleme sont-^lles trop simples pour que nous 
puissions appliquer les lois du hasard? Cette question parait, au premier abord, 
denuee de sens precis; nous savons maintenant ce qu’elle veut dire. Elies son: 
trop simples, si elle conservent quelque chose, si elle admettent une integrale 
uniforme; si quelque chose des conditions initiales demeure inaltere, il est 
clair que la situation finale ne pourra plus etre independante de la situ¬ 
ation initiale. (Science et Methode, pag. 82). 

In biology we have this great complexity . of the causes. Not 
only there are a great number of causes that all influence at the 
same time one biological process, but, moreover, their influence 
is different at different times. 

§ 6 . Suppose that one plots, for two different plants, length 
against age. Probably the two ensuing lines will not coincide, 
for the speed of growth of the plants is hardly ever equal. But 
if they have equal hereditary qualities, practically equal con¬ 
stitutions, and if they are grown under practically equal external 
conditions, their speed of growth will not differ much. One 



85 


time the growth curve of the first plant will show a steeper rise, 
another time the second plant will grow faster. It is impossible 
to determine ’a priori’ which of the two plants will be larger at 
a certain moment, nor is it possible to predict which will be 
growing faster. But if at a certain moment, one plant is much 
smaller than the other, one may be sure that, accidently, the 
external conditions were very unfavourable for that plant (or 
that from the beginning its constitution was inferior) and that, 
moreover, the external conditions have caused an inferior con¬ 
stitution. Therefore one may expect that this plant will remain 
smaller, or at least that the time to overcome its handicap will 
probably be longer than the time to get it. 

On the points of intersection of the growthcurves the growth- 
rate of one of the plants is greater than that of the other. This 
may result from a difference in the constitution, a difference 
in the external conditions or from a difference in both. It is 
even possible that the plant which grows quicker has an inferior 
constitution but that its external conditions were more favourable 
(or the reverse). 

Will the constitution of the more active plant improve on 
that moment? Not even that is certain. If that plant grows 
quicker by reason of its superior constitution, while the external 
conditions were not so favourable, it is probable that its con¬ 
stitution will be influenced unfavourably. But even if it grows 
quicker as a result of better external conditions it is not certain 
that its constitution will improve. This is a consequence of the 
phenomenon of the limiting factors. Suppose that at a cer¬ 
tain moment the temperature is limiting factor. Consequently 
water and nutritive salts are sufficiently present. The plant 
which has a warmer place will grow quicker. If this plant 
has difficulties in pushing out its young root (e.g. caused 
by a grain of sand), its constitution will be inferior if water 
becomes limiting factor. It is therefore impossible to say ’a 
priori’ that the constitution of the faster growing plant im¬ 
proves. Often, however, the same factor is limiting at a cer¬ 
tain moment for all, or nearly all, processes. In that case the 
constitution of the fast-growing plant will improve. 

§ 7 . Suppose three plants a y b> c, with the same hereditary 
qualities and the same constitution when planted are grown 
under practically equal external conditions. If the length is 
determined at a certain moment and the plants are placed in 
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the order of their length, all six possible orders, abc, acb, bac, bca, 
cab, cba are equally probably ’a priori’. 

In two combinations out of six a plant has the same rank. 
Therefore the chance that a plant has a certain rank is 1/3. 
What is the chance that a plant a has a certain rank if the length 
of the two other plants is known? Then it is known which of 
the plants b and c is smaller and therefore has a lower rank, 
suppose b. The possibilities abc, bac and bca are now excluded. 
Only the possibilities acb, cab and cba remain. They are of course 
still equally likely. In each of them a has another rank. There¬ 
fore the probability of each rank of a is in this case also 1/3. 

The probability a priori of a certain rank of plants a at any 
moment is therefore, under the above-mentioned conditions al¬ 
ways 1/3, whether the length of the other plants at that mo¬ 
ment is known, or not. The same holds good for plant b or c. 

If a similar reasoning is applied to (n+i) plants of equal 
hereditary qualities, equal constitution when planted, and grown 
under equal external conditions, one finds that the probability 

’a priori’ for a certain rank for a certain plant is always —yy 

whatever is known about the lengths of the other plants. Even 
in the case that the length of the n other plants is known, the 
rank of the (n+i) th plant may have all values from 1 to n+i 

(inclusive). An the probability of any rank is still jyyy . If the 

n measured plants are arranged in otder of their fnagnitude the 

chance that the (n+i) th plant is smaller than the smallest is — 

1 n + 1 
that it is larger than the greatest is ^yy, and that its length 

lies between any two given plants which are nearest in rank 

is also yyy . The chance that its length lies between the length 

of two plants differing (g+i) in rank is (if two plants 

differ (g+i) in rank there are (g+i) intervals to each of 
which the (n+i)' h plant may belong). If the length of the plant 
limiting the group at the left side ’be lo, and the length of the 
plant limiting it at the right side be li, the above mentioned 
result may be expressed thus: the change that the length of the 

(n-|-i) th plant lies between lo and li is jjyy. 

How large is the probability that the length of the (n+i) th 
plant lies between Lq and Li, if not one of the n measured plants 
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has one of those lengths? Suppose that the length Li lies between 
the lengths of the plants with rank a and a+i, and that the length 
Lo lies between the lengths of the plants with rank a + k and 
a+k+i. Then the probability W that the length of the (n+i) th 
plant lies between L 0 and Li will be smaller than the probability 
that its length lies between the lengths of the plants of rank 
a and a+k+i. The probability W is larger than the proba¬ 
bility that its length lies between the lengths of the plants of 

rank a+i and a+k. The first named probability is 

n + 1 

the second Therefore: < w < 


Likewise: 


k—1 _k k + l 
n + 1 ^ n ^ n + 1 


If one takes — for W the absolute error is less than-". 

n n+1 

Bij taking n large enough this error may be made as small as 
needed. 

The relative error is: 



W 



n + 1 


2 

n + 1 _2_ 

k-1 “ k-1 

n + 1 


The relative error is therefore smaller than 


k-r 


It may now be endeavoured to determine which conditions 
are sufficient so that significance may be attached to a frequency 
curve. The argumentation is made simpler by considering the 
relative frequency curves, wherein the part of the total number 
of plants which have a certain length is plotted against that 
length. Such a frequency curve we will denote by (f.c.). 

It is clearly impossible to say that a plant has exactly a certain 
length, but only that its length lies between certain limits. In 
reality one plots the number of plants, or the part of the total 
number, against the size-classes. 


§ 8 ; A frequency curve is clearly without significance if the 
material gives two very different (f.c.)’s when divided at random 
in two parts (each containing of course a great number of indivi¬ 
duals). It is now possible to establish the conditions which are 
sufficient for the practical identity of the (f.c.)'s of two random 
parts of the material: 
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1. The individuals should have the same hereditary qualities, 

2. When planted they should have had the same initial consti¬ 
tution. 

3. They should be grown under practically equal external 
conditions. 

For if the first lot contains n individuals (n very great) and if 
there are g individuals between the lengths lo and li, the proba¬ 
bility that the length of any individual of the second lot lies 

between lo and h, will be w If the second lot contains also 

very many individuals, the (-f-) th part thereof will lie between 

the lengths 1 0 and li, according to the law of B e r n o u i 11 i. 
Consequently the same part of the whole material as in the first 
lot will be situated between those lengths. Therefore the two 
frequency curves are (nearly) identical. 

Consequently those conditions are sufficient in order that the 
frequency curve be of any significance. But are they necessary 
and are they realizable? The second question certainly should be 
answered in the negative; the condition that the plants should 
have the same constitution when planted cannot be realized. For, 
as already stated, the constitution of a plant at a certain moment 
depends upon the external conditions during its development up 
to that time; and sprouts as well as seeds have developped on the 
motherplant during a certain time. And even if the external 
conditions (sensu stricto) were absolutely equal for two sprouts 
or seeds (which never happens), even then their constitutions 
probably would be unequal, for their connection with the mother 
plant would not be absolutely equal. 

It may be proved that two sufficiently large groups will give 
the same (f.c.)’s if the differences in constitution are very small, 
so small in fact, that 'for no sprout (seed) one has a reason to 
suppose that its constitution is better, or worse, than for any 
other plant. (It is supposed of course that all plants have the 
same hereditary qualities and that they grow under practically 
equal conditions). 

Suppose that among the material, from which the sprouts are 
Chosen, occur a certain number of different constitutions. If the 
differences in constitution are so small as to be indetectable, it 
is possible to choose at random. (If they are not so small this is 
not always possible). Therefore if one selects twice a very great 
number of plants (from the same material), the proportion of the 
number of plants of one constitution to the whole number of 
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plants chosen will be nearly the same in both groups, according 
to the law of Bernouilli. The (f.c.)’s of plants of the same 
constitution are equal. In both groups the (f.c.)’s of the entire 
material are the sums of the (f.c.)’s of plants of the same (initial) 
constitution, multiplied by the proportion of the plants of that 
constitution to the whole material. As the partial (f.c.)’s for the 
two groups are equal, and as they are each multiplied by the same 
number, their sums, the total (f.c.ys, will be equal as well. 

It is thus proved that a frequency curve has only practical 
significance if: 

x. All plants have the same hereditary qualities. 

2. All plants were of a nearly equal constitution when planted. 

3. All plants have been grown under practically equal ex¬ 
ternal conditions. 

We shall call those conditions the conditions A. 

The foregoing reasoning could also be applied if the original 
material, i.e. the material from which the sprouts or seeds are 
chosen, consisted of sprouts or seeds of unequal hereditary qua¬ 
lities. It would be found, by a similar reasoning, that, if two 
groups were chosen at random from the same material, and 
were grown under practically equal external conditions, those 
two groups would give (nearly) equal frequency curves. But 
it may not be thought judiceous to give so much latitude to the 
first condition, for if the original sample of sprouts is exhausted, 
the experiment cannot be repeated, unless the composition of 
the material was exactly known, in which case one could have 
worked equally well with homogeneous material. 

If one considers a material with unequal hereditary qualities 
it will therefore be impossible to deduce general laws from the 
experiments. The same drawback does not hold for a mate¬ 
rial with small differences in the constitution, which involves 
gradual changes. Whenever the mother plants were not exposed 
to extremely variable external conditions the sprouts or seeds 
will have, in part, the same constitution. If one does not select 
extreme variants all the sprouts or seeds, therefore, should have 
practically equal constitution. 

§ 9. Our first problem is now completely outlined. Only 
when the conditions A are satisfied a frequency curve will have 
any quantitative value. Sometimes, however, the inverse pro¬ 
blem is given. One has a certain frequency curve, but does not 
know if the first of the conditions A (genotypic homogeneity) 
is satisfied. If the frequency curve is bimodal this may be a 
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result of genotypic inhomogeneity. But only by experiments 
genotypic inhomogeneity may be proved. For genotypic homo¬ 
geneous material may also give a bimodal frequency curve, as 
e.g. in the experiments of miss T a m m e s on the stalklength 
of Linum crepitans (K a p t e y n and v. U v e n p. 68). We 
have also found, when working with homogeneous material, 
a case of a bimodal frequency curve. The frequency curve of 
the difference in blade-length of the two leaves of the third pair 
of Kalanckoe verticillata S. Elliot was bimodal. 

To be able to answer the second question: 

„What may be concluded from given frequency curves?” one 
must at first ascertain with which aim they are composed. Some¬ 
times one wants to ascertain if there are reasons to suspect 
genotypic inhomogeneity. 

The frequency curves of a characteristic of plants with a 
different heredity will in general be different. Therefore the 
frequency curve of a mixture of those plants will be bimodal 
if the positions of the maxima of the frequency curves of plants 
of equal genotype differed enough. If the positions of those 
maxima differ but little, the frequency curve will be unimodal. 
Therefore bimodality may indicate genotypic inhomogeneity. 
Unimodality does not indicate homogeneity. 

In most cases however frequency curves are set up to ascer¬ 
tain if, for two groups of plants, the same conditions A are 
satisfied. This problem is usually given in this form: A certain 
characteristic is measured for two grpups of plants. It is known 
that the plants of both groups satisfy two of the three condi¬ 
tions A. Is it possible to ascertain from the frequency curves 
if they also satisfy the third condition? If the two curves are 
nearly identical the probability that all plants satisfy the 
third condition is very great. If the difference between the (f.c.)’s 
is very great this probability is very small. 

In other words, if it is known that 

1. There are no reasons to suspect that the sprouts or seeds 
had a different constitution when planted, 

2. The plants were grown under practically equal external 
conditions 

a great similarity between the two (f.c.)’s signifies that the plants 
of both groups were probably of the same heredity. A great 
difference between the curves signifies that the plants were pro¬ 
bably of different heredity. 

If it is known that 

i. The plants are all of the same heredity 
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2, Tlie plants were grown under practically equal external 
conditions 

a great similarity between the two (f.c.)’s signifies that the 
plants of both groups had probably the same constitution when 
planted. A great difference between the curves signifies that the 
plants had probably a different constitution when planted. 

If it is known that 

1. The plants are all of the same heredity,- 

2. The plants had all the same constitution when planted 
a great similarity between the two (f.c.)’s signifies that the plants 
of both groups were grown under practically equal external 
conditions. A great difference between the two curves signifies 
that probably the plants of the one group were grown under 
other external conditions than the plants of the second group. 

Usually the last-mentionned problem is given in this form: 

„It is known that all plants of two groups have equal here¬ 
dity, had equal constitution when planted and that the external 
conditions were all equal except one”. A great similarity in the 
(f.c.)’s of the two groups signifies in this case either that the 
factor vdiich was different in both groups had no influence 
on the process or at least that it was not the limiting factor for 
that process. A great difference in both curves signifies that the 
unequal factor has an influence on the process. 

The most important question which must be answered is 
therefore: It is possible that all plants of two groups satisfy the 
conditions A. The (f.c.)’s of a certain characteristic of the two 
groups have a certain difference. How great is the probability 
’a posteriori’ that the plants of both groups satisfy the con¬ 
ditions A? 

The solutions which are given for this problem generally are 
founded on certain hypotheses on the origin of frequency curves. 
We will discuss the most important of them. 

§ io. The oldest hypothesis about the form of the frequency 
curve supposes that all frequency curves are identical with the 
error curve of Gauss. As far as we know this hypothesis has 
been set up by Q u e t e 1 e t. There are two demonstrations 
that the frequency curve must be identical with the error curve, 
but both are based on incorrect suppositions. 

In the first demonstration (mentioned e.g. in Bauer, Verer- 
bungslehre p. 18) it is supposed that: 

i. A plant is influenced during its development by a great 
number of factors. 
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2. Each plant is influenced by the same number of factors, 

3. Each factor is either favourable or infavourable for a 
certain plant. 

4. The probability that a certain factor is favourable for a 
certain plant is always Yi- 

5. The length of a certain plant is a linear function of the 
number of favourable factors that influenced its growth. 

Two plants which were influenced by the same number of 
favourable factors will have therefore an equal length whether 
those factors were the same or not. 

Suppose that n factors influence the plants during their 
growth. The probability ’a priori’ that for a certain plant 
(a) factors were favourable and therefore (n—a) unfavourable is 

n! J_ 
a!(n—a)! 2" 

All plants for which (a) factors were favourable will have 
the same length (pa+q), (in which p and q are constants) accor¬ 
ding to condition 5. 

Therefore the probability that a plant will have the length 

fra+q) i s i!i^)T IT 

If there are many plants, the number of plants with the 
length (pa+q) will be </> the •-M th part of the total num- 

ber of plants, according to the law»of Bernouilli. If the part 
of the total number of plants which have a certain length (pa+q) 
is plotted against that length and if those points are joined 
by a smooth line the theoretical (f.c.) is obtained. In this case 
the curve is called „binomial curve”. 

If n is very large the binomial curve approximates the error 


k — k*(M—x) J •' 

curve of G a u s s y = -7= e , at least if the units for the 

V 71 


abscissa and the ordinate are suitably chosen. In this formula M 
is the mean length of all. plants, x is the length of an individual 
plant, and k depends on the unit of measuring. 

It is clear that the fundamental suppositions are hardly ever 
realized. The supposition 4 (the probability that a certain factor 
is favourable for a certain plant is always Y) implies that 
all factors are independent. This is very improbable. 

But supposition y (the length of a plant is a linear function 
of the number of favourable factors) is in direct contradiction 
with the phenomenon of the limiting factors. E.g. suppose there 
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are only two factors. Each factor may be favourable or un¬ 
favourable; A (favourable), a (unfavourable), B (favourable), b 
(unfavourable). There are four possible combinations: AB, Ab, 
aB, ab. According to hypothesis 4 these are equally probable. 
According to hypothesis 5 the length of the plants would be 
respectively (ip+q), (p+q), (p+q), (q). 

But if in the cases AB and Ab the second factor is limiting, and 
in the cases aB and ab the first, the length of the plants will 
be respectively (2p+q), (p+q), (q), (q). The resulting frequency 
curve, in this case, will be non-binomial. This reasoning is there¬ 
fore incompatible with reality. 

§ii. There is still another reasoning to prove that in biology 
the frequency curve should be the error curve of Gauss. First, 
the whole growth period is divided into a large number of ele¬ 
mentary periods. It is supposed that the increase in length of a 
plant during an elementary period depends only upon the external 
conditions during that period; furthermore that the probability 
of a certain constellation of the external conditions at a certain 
moment is equal for all plants. This equality implies that the 
probability of a certain constellation for a certain plant does 
not depend upon the constellation during a former period! 
Finally it is supposed that the mean increase of all plants during 
an elementary period is equal to the median increase. 

The total length of a plant is the sum of the increases in 
length during the elementary periods. Such an increase in length 
may be conceived as the algebraic sum of the mean increase in 
length and the deviation from that mean increase for that parti¬ 
cular plant. The total length of a plant, therefore, will be equal 
to the mean length of all plants plus the (algebraic) sum of 
its deviations from the mean increase during the elementary 
periods. The hypothesis that the mean increase during an elemen¬ 
tary period is equal to the median increase implicates that the 
probability of negative and positive deviations from that mean 
are equal. Therefore the length of a plant is a constant (the 
mean length of all plants) plus the sum of a great number of 
quantities which are as likely positive as negative. The proba¬ 
bility that a certain plant has a definite deviation from the 
mean length of all plants is therefore equal to the probability 
that the sum of the quantities - aforementionned has a certain 
value. It has been proved by Bessel that this probability follows 
the law of Gauss. Therefore the frequency curve of the length 
of the plants will also be equal to the error curve of Gauss. 
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There are two serious objections against the hypotheses upon 
which this deduction of the form of the frequency curve is based; 

1. The supposition that the increase in length of a plant de¬ 
pends only upon the external conditions will be rarely true. 
If two plants haive a different constitution their increase in 
length during an elementary period will be different if the 
external conditions are equal. 

And generally the constitutions of two plants are different. 
Therefore the action on a plant of a certain constellation of 
the external conditions will depend upon the constellation of those 
conditions at previous moments as this constellation has in¬ 
fluenced the constitution of the plant. 

2. But, moreover, the constellation of the external conditions 
during successive periods will not be independent as those con¬ 
ditions change gradually. 

Therefore the suppositions on which this deduction of the 
form of the frequency curve is based are not realized in nature. 

§ 12. Why has it so often been tried to prove this identity 
with the error curve of Gauss? The reason seems to be pri¬ 
marily historical. About 1835 Quetelet had observed that 
the frequency curve of the breast circumference of Scottish 
conscripts was nearly identical with the binomial curve, which 
passes into the error curve when the exponent n becomes very 
large. Quetelet says: 

L’exemple que je vions de citer (viz. the* frequency curve of the breast 
circumference of Scottish conscripts) merite je crois notre attention: il 
nous montre que les choses se passent absolument comme si les poitrines 
qui ont iti mesurees avaient hi modelees sur un merae type, sur un memc 
individu, ideal si Ton veut, mais dont nous pouvons saisir les proportions, 
par une experience suffisamment prolong£e. Si telle n* 4 tait pas la loi de 
la nature, les mesures ne se grouperaient pas, malgr£ leurs d£fectuosites 
avec P^tonnante sym£trie que leur assigne la loi de possibility (Quetelet*, 
1846, p. 137). 

In point of fact Quetelet gives only an experimental justifi¬ 
cation of the identity of the frequency curve and the binomial 
curve. Of the modern writers E. B o r e 1 shares this point of 
view (E. Bor el, 1924. p. 127, p. 178). As soon as it is shown 
that often this identity does not exist the reasonings of Que¬ 
telet and B o r e 1 lose their value. And many frequency curves 
are found which are no binomial curves. 

How is it then that in computing the results of biological 
experiments the methods used are generally based on the normal 
law of error, that is to say: it is supposed ’a priori 5 that the 
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frequency curve is identical with the error curve of Gauss? 
Probably this is a consequence of the fact that most biologists 
forget that in biological experiments deviations from the mean 
usually are caused by the variability of the material. Those 
deviations are therefore not true errors, as the errors in phy¬ 
sical experiments for instance. But it is usual to give them the 
name of error and so it is forgotten what they really are. 

In answer to the objections of Bertrand (Bertrand 1907, 
p. 170 et sqq.) Poincare gives a very general proof of the 
identity between the error- and the Gaussian curve. He puts 
the problem in the following way: Sunnose z is the real value 
of the quantity which is measured. How large is the probability 
that the result of a measurement will fall between (x) and 
(x-\-dx)? Without any hypothesis this problem is insoluble. Poin¬ 
care supposes that: 

1. This probability is proportional to dx . 

2. The mean value of x is equal to the probable value of z. 

He puts the probability that, the true value being z the result 

of a measurement will fall between (x) and (x-\-dx) 

dW — <f> (x,z) dx 

Clearly y = $ (x y z) is the equation of the error curve. 

Which meaning may be attached to (x), (z) and y = <£ (x y z) in 
variation statistics? 

x is clearly the length of a single individual. In § 7 it has 
been demonstrated that a frequency curve has only any prac¬ 
tical value if the material satisfies the conditions A. In that case 
a repetition of the experiment will give a practically identical 
frequency curve. If the (f.c.) J s of two groups of plants are 
equal the mean length of those plants will also be equal. 

If the external conditions had been different probably the 
maxima of those (f.c.)’s would have been at another place. Then 
their mean length would have been different. Therefore the mean 
length may be taken as a measure for the external conditions. 
In that case y = <f> (x y z) is the form of the frequency curve if the 
external conditions were such that the mean length of the 
plants is (z). 

We have now identified our problem with the problem of the 
form of the error curve as put by Poincare. Therefore the 
solutions of both problems should also be equal. P o i n c a r 4 
finds for the formula of the error curve 

(x,z) = 6 (x) e — J t ( 4 ( z ~x) dz. 

B(x) is a function of x only, which indicates to what extent 
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some values of x have a larger probability than other values, 
irrespective the value of z. E.g. in discontinuous variability 9 (x) 
will be zero for any fractional value of x, and it will be equal to 

for any integer value of x. In continuous variability 9 (x) 
generally will be constant. 

if/ (z) is the probability ’a priori’ of a certain value of z. In the 
calculus of errors 4 > (x,z) signifies the probability of committing an 
error (x-z). Poincare reasons that the probability ’a priori’ of an 
error will not depend upon the probability ’a priori’ of the mag¬ 
nitude of the measured quantity. Generally this will be true, but 
not always. E.g. the error made when the place of a star low at 
the horizon is determined probably will be greater than the error 
made when the star shines high above the horizon. But if one 
accepts the hypothesis of Poincare that the probability of an 
error does not depend upon the probability ’a priori’ of the mag¬ 
nitude of the measured quantity, then the error curve will be the 
curve of Gauss multiplied by 9 (x). For in that case f (z) is 

constant and f f (z) (z — x) dx — k? (z — xf . 

(z — x) is the error. 

In variation-statistics <t> (x,z) is a measure for the absolute 
variability. But it is certain that the absolute variability depends 
upon the mean length of the plants. The absolute variability of 
oaks hundred years old will be larger and much larger than the 
absolute variability of oaks a -hundred days old. Therefore, in 
the case of variation-statistics, one may not put •A (z) = constant, 
but then the frequency curve will not be the error curve of 
Gauss. Only in cases where the absolute variability does not 
depend upon the mean length it may be expected that the form 
of the frequency curve becomes identical with the form of the 
error curve of Gauss. 

§ 13. Rather early in literature it was observed that there are 
cases where the normal curve does not fit a given frequency 
distribution. Pearson has given a number of equations to fit 
any given frequency distribution: Though the original idea was 
to generalize the binomial curve by supposing that not all factors 
were independent, the constants of the found frequency curves 
have often such a value that a biological interpretation is rarely 
possible. Virtually all those frequency curves are only graduation 
curves as Pearson has now admitted in a letter to the Editor 
of ’Nature’ (24 Aug. 1935 p. 296). A serious drawback to all 
such graduation curves is the absence of any causal relation between 
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the constants of the curve and the factors that have caused the 
frequency distribution. It is possible that a small difference in 
those factors brings about a great difference in the constants of 
the curve, or even a difference in the formula, thougth the form of 
the frequency curve has changed but little. 

§ 14. Edgeworth and Kapteyn have used another 
method to determine the form of the frequency curve. Their 
startingpoint is the thesis that a non-linear function of a quantity 
which is distributed according to the normal or Gaussian law 
cannot be distributed according to that law. Bertrand has also 
made this remark. He based upon it a severe criticism of the 
error curve of Gauss. Pearson criticizes the method of 
Kapteyn and Edgeworth. According to Pearson the 
relation between two quantities is, in biology, never causal but 
only correlational. His illustration of this remark gives the im¬ 
pression that he is trying to split straws. He says (Biometrika 4, 
1905 p. 199): 

The sagittal arc in English women is almost exactly normal in its distri¬ 
bution and nasal breadth is very asymmetrical. Shall we take x sagittal arc 
and X nasal breadth and make x = F (X) — M ? 

Anyone who has read the paper of Kapteyn wil know that 
Kapteyn means causal relation as e.g. in the case of the dia¬ 
meter and volume of spherical 'berries. If the frequency curve of 
the diameter of those berries is a Gaussian curve, the frequency 
curve of the volumes should be asymmetrical. If the range of the 
diameter is large the asymmetry will be very pronounced. 

That the frequency curve of the diameter and the volume of 
a group of nearly spherical berries are never at the same time 
Gaussian curves is easily demonstrated by an example. Suppose 
there are n berries with a diameter between 1 and 2, and also 
n with a diameter between j and 6 (at both sides of the maximum 
of the frequency curve). The volumes of the first-mentioned berries 

will lie between -|-ir and 8. The volumes of the other group 
4 4 

will lie between—ir. 125 and—w. 216. Suppose the class width of 

the volumes is -|-w. 7. The first named n plants will all fall into 

one class, the other n (whose diameter lies between j and 6 ) will 
fall into 13 classes. Therefore the frequency curve of the volumes 
will be skew. It is clear that in this case the quantities which 
follow the normal law are no shadow quantities (Pearson 
p. 201). 
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In the same paper Pearson gives — • ~ =-t^-. as the 

y ax a 0 ‘r(xj 

only form of the differential equation of the frequency curve. 
As Kapteyn.has shown (Kapteyn 1916) this is simply the 
Edgewort h—K a p te y n formula in the differential form. 
Only the interpretation of F(x) is different. Pearson calls J/F(x) 
the standard deviation of the ’instantaneous Gaussian curve’. 
Edgeworth and Kapteyn suppose that there is a quantity 
y = F (x) which is distributed according to the normal law. The 
difference is thus only a question of the interpretation of F (x). 

There is however one point in the theory of Kapteyn which 
hardly has been discussed by Pearson and this is its most 
important part. As we have seen the supposition that the increase 
in length of a plant during an elementary period depends only 
upon the external conditions, and that the external conditions on 
a certain moment are independent of the conditions during any 
previous moment, leads to a normal or Gaussian frequency curve. 
The fault in this reasoning is that it does not take into account 
the constitution of the plant. Kapteyn supposes that the 
length is a good measure for the constitution of a plant. The 
growth rate of two plants of the same length should only depend 
upon the external conditions. If the external conditions were 
equal for two plants the growth rate would only depend upon 
the length of those plants. From those hypotheses and the additional 
one that the external conditions on two successive moments are 
independent, Kapteyn is able to deduce the form of the 
frequency curve. 

_ h F 1 (x) „ — h* [F (x) — F (x 0 ) — M] 2 
y |/TT 

is the growth rate of a plant of the length x (k denotes the 

magnitude of the external conditions). From a given frequency 
curve F (x) may be deduced, as Van Uven explains on p. 35 
(Kapteyn and Van Uven 1916). By graphical differentiation 

F (x) and —^ may be deduced from F (x). If the suppositions 

of Kapteyn were true it would thus be possible to determine, 
from i given frequency curve, how the growth rate of a plant 
depends upon its length Under constant external conditions. But 
the fundamental supposition of Kapteyn, that the constitution 
of a'plant is unambiguously determined by its length is certainly 
not , correct. The constitution of plants of the same length but of 
different ages is different. But if the constitution is different the 
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growth rate will be different as well. If the growth velocity as 
function of the length is calculated at different moments according 
to the method of Kapteyn and v. U v e n, the results will be 
different, as happened in our experiments (p. 1x9). 

There is still another objection against the deduction of Kap¬ 
teyn. The external conditions at two consecutive elementary 
periods are supposed to be independent, and this never happens. 
Almost all external factors change gradually. If the soil around 
a plant is humid, it will not be dry some moments later. The 
reverse may sometimes happen but than the soil will remain more 
or less humid for some time. Neither does the temperature change 
abruptly. And even the light will usually alter gradually. Only 
at certain moments when the sun appears suddenly from behind 
the clouds, the changes in the intensity of the light will be very 
large. Though theoretically these objections are very grave they 
have probably no influence on the results of Kapteyn. For 
it is possible to replace the supposition that the external conditions 
on consecutive moments are independent by another hypothesis 
which is a better image of nature, and still obtain the same result 
as Kapteyn did. 

Suppose that during a certain time factor a is limiting. The 
magnitude of that factor may have 2 k different values. We 
shall suppose that the growth rate is a linear function of the 
magnitude of factor a. If the magnitude during a certain 
elementary period is g we will suppose that the probability that 

during the following period it is (g—1) is and that during 

the following period it is (g+x) is k a . If k is very large the 

magnitude of the factor a changes gradually. A further advantage 
of this hypothesis is that deviations from the mean of the 
magnitude of the factor tend to become smaller. If the magnitude 
of a limiting factor changes always in this manner from element¬ 
ary period to elementary period the calculations of Kapteyn 
and van U ven still hold good. (For the calculation vide p. 123). 

Our general conclusion on the theories which explain the form 
of the frequency curve may be stated as follows. 

The growth rate of a plant depends upon its heredity, its 
constitution at all moments of its growth and also depends upon 
the external conditions. The constitution of a plant depends upon 
its age and on its previous history. Therefore it is impossible: 

1. To determine ’a priori’ the form of the frequency curve. 
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2. To deduct ’a posteriori’ the relation of length and growth- 
rate from the form of the frequency curve. 

§ 15. As we have said the problem of the interpretation of 
frequency curves is usually given in this form: „The frequency 
curves of the same characteristic of two groups of plants are given. 
It is ’a priori’ possible that for both groups the conditions A 
were the same. How large is the probability ’a posteriori’ that 
this equality of the conditions A has existed?” 

The school of ’Student’ and Fisher formulates this problem 
in another way. They consider a given frequency distribution as 
a random sample from a very numerous assemblage of plants. 
It is supposed that all those plants satisfy the conditions A. 
Such an assemblage they call ’Universe’. If a random sample 
is taken from that ’Universe’ how large is then the probability 
that the mean of the sample differs a certain number of times 
the standard deviation (of the sample) from the mean of the 
’Universe’. To answer this problem the law underlying the fre¬ 
quency distribution of the ’Universe’ must be known. If the 
means of two given frequency curves differ much, the probability 
that they are random samples of the same ’Universe’ will be very 
small, independent of the way in which the composition of this 
’Universe’ is chosen. In that case it is very improbable that the con¬ 
ditions A were satisfied for both groups. But if the difference 
between the frequency curves is not large it will be possible to 
find such a composition of the ’Universe’ that it is not improbable 
that both are random samples from that ’Universe’. If it was 
*a priori’ possible that the conditions A were satisfied, one mav 
now conclude that it is very probable that they were satisfied. 

Is it possible to attach any biological significance to the notion 
’Universe’, and in how far is it possible to regard a given frequency 
curve as a random sample from that ’Universe’? As we have 
demonstrated, if the conditions A are satisfied, the (f.c.) of any 
characteristic will tend to a certain limit if the number of 
individuals is very great. This limit-(f.c.) may be regarded as 
a description of the law underlying the frequency distribution 
of the ’Universe’. And if one. has only measured a limited number 
of individuals those may be regarded as a random sample from 
that ’Universe’. It is thus possible to attach significance to the 
notions ’Universe’ and ’random sample’. But why are those notions 
necessary? 

The whole theory of errors is based upon the supposition that 
there are a great number of observations. (As we remarked already 
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this theory is in an erroneous way also applied by biologists, 
though here the cause of discordance between the observations 
lies in the variability of the material). As in general the real 
value of the quantities measured and of the errors is not known, 
they are simply equalized to the mean value of the observations 
etc. According to the law ofBernouilli the difference between 
the probable and the mean value of a quantity tends to zero if 
the number of observations tends to oo. Therefore in this case 
(n^oo)the probable value of the observed quantity and of the 
error are known with great accuracy. If the real value of mean 
and probable error are known it is possible to determine the 
probability that, in a new set of observations, the mean will 
differ a certain quantity from the original mean. But if the number 
of observations is not very great all foregoing reasonings lose 
their value and it is impossible to determine with any accuracy 
the probable value of the observed .quantity and of the probable 
error. But then it is also impossible to determine the probability 
that in a second set of observations the mean will differ a certain 
quantity from the original mean. This inconvenience does not 
exist in the method of ’Student 5 and Fisher, for they work 
only with the mean and standard deviation of the sample, which 
are entirely known, and with the mean of the ’Universe 5 , which 
is given ’a priori 5 . This seems a great improvement but....there 
is one great drawback. To apply this method the law of the 
frequency distribution of the ’Universe’ must be known ’a priori’, 
and in biology this rarely, if ever, happens. Therefore in biology 
the method of ’Student’ and Fisher seems in general to be 
unapplicable. 

§ 1 6. It seems thus that there is no unobjectionable method 
to compare frequency curves by calculation. A direct comparison 
has also great drawbacks, a difference between two (f.c.)’s ia 
the neighbourhood of their maxima has another significance as 
a difference at their ends. Particularly when there is ’high contact’. 
Furthermore the form of the (f.c.) is generally rather complicated, 
differences in form are therefore difficult to observe. The most 
practical method is therefore to compare the deviations of the 
(f.c.)’s from any well-defined standard curve. This curve should 
have the following properties: 

i. Its range must be unlimited. It is true that the range of a 
biological characteristic is never unlimited, but at the other hand 
it is hardly ever possible to state of any individual that the 
characteristic has a definite maximum value. 
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2. Its ordinate should be known or should be easily calculated 
for each value of the a'bcissa. 

3. Its form must not deviate too much from the general 
form of (f.c.)’s, otherwise the curves of the deviations from the 
standard curve will have a complicated form, and then our goal 
is not reached. The error curve of Gauss satisfies all those 
conditions. Therefore the best method of comparing (f.c.) 5 s is to 
compare their deviations from the error curve of Gauss. But 
the method of doing this is equal to the method of Kapteyn 
and van Uven to determine the function 2 = F (x) which is 
distributed according to the normal law described in their second 
paper (Kapteyn and v. Uven 1916). In the same paper the 
formula of the probable error of the ordinate z of the curve F ( x ) 
is given. It is thus possible, of two (f.c.)’s are given, to determine 
if their abscissal distance between points of equal ordinates is at 
all points less than the probable error, in which case conditions 
A were probably satisfied, or if their ’distance 5 is at any point 
more than three times the probable error, in which case the con¬ 
ditions A were, probably, not satisfied. If the form of the two 
curves differs much it is also ivery improbable that the conditions 
A were satisfied. In this case, moreover, generally some points 
of the second curve will deviate more than three times the 
probable error from the corresponding points of the first curve. 

II. STATEMENT OF THE PROBLEM, MATERIAL AND 

METHODS. 

§ 1. In the introduction it has been demonstrated that it is 
impossible to determine 5 a priori 5 the form of the frequency curve 
of the length of plants. The demonstrations that the frequency 
curve must be identical with the error curve of Gauss were 
unacceptable because they were based upon suppositions which 
are not realized in biology, e.g. the constitution of the plants is 
not taken into account. And even the theory of Kapteyn, 
which is a great improvement (for it takes this constitution into 
account) is unacceptable; its measure for the constitution is in¬ 
correct. We have tried to demonstrate experimentally that those 
objections against the theory of Quetelet c.s. and of Kap¬ 
teyn are of practical importance. 

The easiest method to demonstrate the validity of our objections 
against the theory of Quetelet c.s. is to prove that the conclusions 
are faulty. Therefore it is necessary to determine the (f.c.) of any 
characteristic (e.g. the length) of a number of plants satisfying 
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the conditions A. According to Quetelet one should find the 
error curve of Gauss. In none of our experiments this happened. 
Neither have we found in the litterature any example of the 
Gaussian curve if the conditions A were satisfied (e.g. Johann- 
sen 1903. Pearl 1907. Dennert 1926). 

A second method to justify our objections is to demonstrate 
directly that the growth-rate depends upon age and previous 
history. For this method it is necessary to measure regularly the 
length of a great number of plants which satisfy the conditions A. 
In this manner it is possible to determine the growth-rate of 
plants of equal age but of different length and of plants of 
different age and of equal length. According to the adherents 
of the theory of Quetelet the growthrate must be independent of 
age and of the length of the plants. According to Kapteyn 
it must be independent of age but may depend upon the length 
of the plants. It will appear that it depends upon both. 

A third method is to compare the (f.c.)V of a certain charac¬ 
teristic if the plants are classified according to another charac¬ 
teristic. If the magnitude of a certain characteristic is different 
for two groups of plants their previous history will have been 
different. But then their constitution is unequal at that moment. 
If after that moment another characteristic develops it will 
originate from plants with different constitution. Therefore the 
(f.c.) 5 s of the two groups of plants will be different for the second 
■characteristic. * 

In our experiments we have measured the breadth of the first 
leaf a short time after the sprouts of Kalanchoe were planted. 
There existed practically no stem at that moment. But even at 
.the concluding measurement there was an obvious difference in 
the frequency curves of the length of those plants, whose breadth 
of the first leaf was different some 8 months ago. 

§ 2. To execute those experiments it is necessary to have a 
number of plants satisfying the conditions A. The first condition 
is hereditary homogeneity . There are two ways to obtain this 
homogeneity. Either one should measure the same organs of one 
plant, or one should measure the length of plants which are the 
vegetative ofspring of one mother plant. (Virtually those sprouts 
may also be regarded as the same organs of one plant). We have 
followed both ways. 

§ 3. In the first place we have measured the length and the 
breadth of the foliola of full-grown leaves of Robinia Pseudacacia 
L. The experimental plant was a shrub growing at a distance 
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of about 2 M. from some other shrubs which screened it against 
the W. and N. winds. At the E. and S. side it was screened by 
wood. The shrub originated probably from the root of a Robinia- 
tree some 4 m.* distant. Its age was about 4 years. The leaves 
were gathered and measured in the last week of August. It was 
rather difficult to determine whether a leaf was full-grown. But 
as the total length of the full-grown leaves increases from the 
base to the apex of each branch, those leaves were considered 
full-grown which were not smaller than their predecessors. 

The foliola were measured by placing them on a piece of ruled 
paper. The distance of the lines was 1 mm., 3^2 mm/$ were 
estimated. The number of observations ending in entire mm/s 
was twice the number ending in Yi mm.’s. This is of cours caused 
by a faulty estimation. D e n n e r t (working with the needles of 
Taxus baccata L) has probably made the same mistake, but he 
ascribes the differences to a real fluctuation or ’Rythmus’ of the 
frequency distribution. (D e n n e r t 1926, e.g. fig. n). For the 
greater number of computations we have therefore joined two 
succeeding classes to one class. 

§ 4. Our second set of observations were made on Kalanchoe 
verticillata S. Elliot, a Crassulacea. Kalanchoe verticillata is a 
herbaceous perennial plant. Its lower leaves are decussate, but 
higher on the stem the leaves alternate. At the top of the leaves 
there are 4-12 dents. On these dents the sprouts are formed. 
These sprouts consist of two leaves placed on a ’disc’. The smallest 
of those leaves is directed toward the mother plant. (Fig. 1). 
If those sprouts are not too young they are easily separated from 



i\ 1 ■ 


Fig. 1. 1. Apex of leaf of Kalanchoe verticillata S. Elliot. A. ’Dent’- 

B. Sprout. C. Blade of motherplant. Nat. Size. 

2. Sprout. D. Smallest ’cotyledon’. E. Largest ’cotyledon’. F. Apical 
bud. G. ’Disc’, x 2. 
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the mother plant. If such a detached sprout is laid on the soil the 
disc forms roots and the bud between the leaves develops into a 
stem. The first year the plant generally bears no flowers. As long, 
as the apical bud exists the lateral buds do not develop. 

In December 1933 450 sprouts originating from one plant were 
placed on moist sand. In march 1934 each young plant was 
placed in a flowerpot. Those flowerpots were placed in a hot 
bed. By the beginning of July the length of the stems was 
measured, and the number' of leaves counted. As the plants did 
not all grow on the same distance from the glass their external 
conditions were not practically equal. Therefore those frequency 
curves have no very great value, but they were not Gaussian. 

9 6 of those plants which showed about the same length were 
used as mother plants for our definitive experiment. From the 
apex of three full-grown leaves of each of those plants twa 
sprouts were taken. These sprouts- were planted in 48 wooden 
boxes. In each box 12 plants were placed deriving from two 
mother plants. 

The wooden boxes had an internal diameter of 5 X 5 X 55 cm. 
Five holes were drilled in the bottom for draining purposes. On 
the bottom was a layer of well-washed gravel about 1 cm. 
high. The boxes were filled with sand which was washed seven 
times in ordinary water. 

They were placed in eight rows of six before the window 
of a room on the S. side of the laboratory. All boxes stood 
parallel to the window, the rows were perpendicular to the 
window. The temperature of this room was not constant, but 
it sunk never below 4°C. Only during the end of December and 
the beginning of January it was necessary to heat the room. 

As the posterior boxes were further from the window every 
other day all boxes were displaced. If the original order was 
1, 2 > 3) 4) 5> 6 then after the displacement the order was 6 , 
1, 2 > 3s 4> 5- Moreover each box was turned 180° and each 
row was displaced as a whole one place to the right. The row 
at the extreme right side being placed at the left side. 

If the soil was dry the boxes were placed in water which was 
sucked up through the holes in the bottom. About once in six 
weeks a solution of v. d. Crone (to which was added traces 
of H 3 BO4 and of Mn SO4) was used instead of ordinary water. 
If the weather was very sunny the plants were, moreover, 
sprinkled with water. 

After two weeks two of the boxes were upset and their plants 
spoiled. There remained therefore 4 6 boxes containing 552 
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plants. Of those plants one has died, all other developped more 
•or less. The experiment was terminated on the 4'“ of July 
1935. On October 24 the breadth of the largest of the first leaves 
(which have developped on the mother plant) was measured, with 
the aid of measuring calipers. The accuracy of this instrument 
was 0.0 j mm. At November 16, January 11, February 6, 
March 11, March 27, April 30, May 15, June ix, July 5 the 
length of the plants was measured from the lowest leaves to 
the terminal-bud. A piece of ruled paper was placed besides the 
plants on which the length was read. Tenths of mm.’s were 
estimated as long as the plants were lower than 3 cm. From 
plants which were higher than 3 cm. only the mm.’s were 
noted. 

On December 10 the length of the leaves of the third pair 
(when counted from the bottom) were measured. 

On November 16, January 16 (F), February 13 (H), March 21 (L), 
April 8 (N), May 2, May 21 the number of full-grown leaves of 
each plant was counted. Sometimes a plant dropped one or more 
leaves but the leafscars, which were always readily recognizable, 
were then counted as leaves. It was not easy to determine what 
should be reckoned as the highest full-grown leaf pair, for the 
terminal-bud continually forms new leaves. We reckoned those 
leajves as full-grown which did not touch another. 

How far did our plants satisfy the conditions A? The first con¬ 
dition was certainly satisfied for all ^plants had the same here¬ 
dity, because they all originated from the same P 2 . Our second 
condition was that the plants must have had a practically 
equal constitution when planted. Although all the mother plants 
grew under practically equal conditions the constitution of the 
sprouts was not equal. For the age of the sprouts, when detached 
from the mother plants was probably unequal, as it was very 
difficult to determine if the leaves of the mother plants on 
which they grew were just full grown. The older a sprout the 
more reserve food it has stored, and the better its constitu¬ 
tion (and inversely). In all those cases in which the quantity 
of food stored in sprouts or seeds is small, this quantity will 
chiefly -determine the constitution of those sprouts or seeds. 
For this food has to supply the energy to form roots. And the 
mere and the stronger roots are formed, the better the plant 
will grow. But if the. food in the sprouts or seeds is super¬ 
abundant all plants may form their roots equally well, and 
the differences in the quantity of the food do not cause a dif¬ 
ference in constitution. 
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As sprouts formed on leaves are generally small their quan¬ 
tity of reserve food (and therefore their magnitude) will gene¬ 
rally be a measure for their constitution. The best manner to 
obtain plants of equal constitution would be to weigh them, 
but as they are very small this would take much time. It 
would be impossible to plant many sprouts on the same day, 
in other words the plants would not have all the same age. 
The second best method to select a great number of plants of 
the same constitution is to measure, some weeks after the sprouts 
are planted, and before the stem develops, the breadth of the 
largest leaf. If one chooses now for the experiment only those 
plants whose leafbreadth was nearly equal, they will probably 
have a nearly equal constitution. We 'have followed more or 
less this way in our experiments. 

The third of the conditions A was that the plants should have 
been grown under practically equal external circumstances. By 
practical equal external circumstances was meant that for no 
period it was possible to indicate a factor which was more 
favourable for one as for another plant. By our regular dis¬ 
placements each box occupied all possible positions in rela¬ 
tion to the windows, in the course of time. As far as light and 
temperature is concerned the conditions may therefore be desig¬ 
nated as practically equal. As the plants were always watered 
on the same day the 'humidity and the supply of mineral 
salts was also practically equal. 

The only question that ‘has not been answered is therefore: 
„Has Kalanchoe verticillata a susceptible period in its growth?” 
We have not found any indication of it, and as the growth 
is a continuous process it does not seem probable. It is clear 
that the susceptible period will have only any influence on 
the form of the frequency curve if the external conditions are 
not constant. For two plants generally are not at the same 
time in a susceptible period, and if the external conditions are 
not constant, those plants will be influenced during their sus¬ 
ceptible period by different external conditions. And then their 
constitution will be different during the remainder of their 
life. To eliminate this possibility we have tried twice* to grow 
Kalanchoe verticillata under constant external conditions. The 
first time we raised the plants in watercultures in a constant 
temperature room in the Botanical Laboratory at Leyden. But 
all plants died in consequence of an fungous infection. The se¬ 
cond time we raised the plants on sand cultures, also in the con¬ 
stant temperature room. The plants were constantly illuminated 
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by six „ window case” lamps, for if the susceptible period should 
happen to be very short or intermittent with a period of 24 
hours, the difference between light and dark would have in¬ 
fluenced the constitution of the plants. Moreover if intermit¬ 
tent light was used it would have been necessary to use very 
strong lamps,' to give the plants a quantity of light equivalent to 
the daylight. But such strong lamps give so much heat that it is not 
possible to keep the temperature of the room constant. 

This constant illumination however, has caused a luxuriant 
growth of Cyanophyceae, which so hardened the soilsurface 
that the plants died. So this second experiment under constant 
external conditions did also fail. 

Summarizing our discussion of the material and methods we 
may conclude that: 

Kalanchoe verticillata has some great advantages for this kind 
of investigations. It is very easy to get any number of sprouts 
of equal heredity. The plants grow well on sand if occasionally 
a v. d. Crone solution is given instead of water. It is easy to 
measure the plants as they do not branch as long as the ter¬ 
minal-bud is uninjured. And finally this species does not make 
high demands as to temperature (which in our case varied be¬ 
tween 4 0 and 25 0 C.) and humidity. 

Kalanchoe also has some drawbacks. It is very difficult to 
obtain sprouts of equal constitution. Moreover they are not easy 
to grow on water cultures, and only if plants are grown on 
water culture the external conditions (at least as far as the 
roots are concerned) are practically equal with certainly. 

III. A BRIEF SURVEY OF THE RESULTS. 

1. The results of the measurements of the foliola of Rohinia 
Pseudacacia . 

How far do the leaves of a plant satisfy the conditions A? 
If the plant is not a periclinal chimera all leaves should have, 
of course, equal heredity (bud variations are very rare). So 
the first condition is satisfied. 

The second condition was that the sprouts or the seeds must 
have had an equal constitution when planted, that is to say 
when they began to grow independently. But does an organ 
of a plant grow independently? Most organs have a certain 
autonomy, but still their connection with the rest of the plant 
is of primordial importance. And when does a leaf for instance 
begin to grow ’independently’? Perhaps one may choose the 
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moment on which the cells are formed which will only develop 
into a leaf as the moment on which the leaf begins its ’inde¬ 
pendent 5 existence. But in that case there are no reasons to sup¬ 
pose that those newly-formed cells have not always practi¬ 
cally the same constitution. So the second condition is also sa¬ 
tisfied for the leaves of one plant. 

It is not so certain that the third condition, equality of the 
external circumstances, has been satisfied. For, in general, the 
leaves formed in spring will have been under other external 
conditions during their development as the leaves formed in the 
summer. But at the other hand the climate in Holland is so 
variable that there are periods in the summer whidh are less 
favourable for the development of leaves than some periods 
in spring. It is therefore not possible to say 5 a priori 5 that a well 
defined leaf (e.g. the 4 th leaf of the third branch) has grown 
under more favourable circumstances than any other leaf. But 
as this was our condition of practical equal conditions we may 
conclude, if with a slight hesitation, that the third condition 
was also satisfied in this case. 

268 leaves, with 3929 foliola, were measured. We have first 
put the results of our measurements in a correlation table 1 ), 
(length of the foliola against breadth). As the terminal leaflet 
occupies a position apart we have not classed them with the 
other leaflets in the same table. The two tables (of the paired 
and of the unpaired leaflets) differed indeed. If the length of a 
paired leaflet was equal to the length of an terminal leaflet its 
breadth was generally smaller. 

The correlation between length and breadth was in both cases 
high, but as the correlation was certainly not linear we have 
not computed the correlation coefficient. In general the corre¬ 
lation coefficient does not seem, to have a great importance for 
biological research, except if its absolute value is very high 
(above 0,9) or very low (under 0,1). For the whole theory of 
correlations is based on the same suppositions as the theory of the 
normal frequency curve, and. has the same theoretical value 
as this theory. Moreover it is impossible to gather from the 
correlation coefficient if there are any systematical deviations. 
One has a whole set of valuable observations and one exchanges 
it for a number of doubtful value. 

The frequency curve of the length of the paired leaflets was 

4 ) The records of our measurements are kept in the Botanical Laboratory. 
They are available for all investigators who are interested in them. (Please 
apply to Botanical Laboratory Government University, Leyden, Holland.) 
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asymmetrical. The frequency curve of the breadth of the leaflets 
was also slightly asymmetric and moreover leptokurtic (fig. 2). 



We have also traced the frequency curve of the difference 
in length and of the difference in breadth of two opposite leaflets. 
If anywhere then here it may be expected that a Gaussian 
curve should appear. For the difference in length and in breadth 
of two opposite leaflets is very small. In more than half the 
number of cases the differences in length were less than 1.5 mm. 
and in more than three fourth of the cases the differences in 
breadth were less then 1.5 mm. As the differences in the total 
length were very small it may be expected that the differences 
in growth rate were also very small, the more so as the exter¬ 
nal conditions of two ..partners” cannot have differed much. 
The difference in growth rate of the two leaflets would have 
been almost as often positive as negative if the constitution 
was always equal, for the external conditions will hardly ever 
favour always the same leaflet. In that case (equality of the 
constitution) the final difference in length would be the sum 
of a great number of independent quantities which were as 
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probable positive as negative. The frequency curve of the dif¬ 
ferences in length of two opposite leaves would thus have been 
a Gaussian curve. But as is shown in fig. 3 the frequency curve- 



Fig. 3.' z-function for the differences in length .... and for the differences 
■' in breadth x x x of two opposite foliola of Robinia. The line is 
the z-function of the Gaussian curve with the same mean and a 
as the curve of the differences in breadth. Abscissa difference in 
length in mm (lower side) and of the differences in breadth (upper 
side) in mm. Ordinate z. 


is clearly leptokurtic. Therefore the constitution has evidently 
influenced the growth rate, and the suppositions on which are 
based the derivation that the frequency curve must be a Gaussian 
curve were not satisfied. . t . 

The frequency curves of the differences in length and of the 
differences in breadth are equal if the unities of length are suitably 
chosen. This means probably that there is an internal factor 
which determines the form of the leaflet and that if the con¬ 
stitution is unfavourable for the growth in length it is gene¬ 
rally as unfavourable for the growth in breadth. 

! I The’. general conclusions ■ on the measurements of the leaves 
of a Robinia Pseudacacia shrub are therefore that the frequency 
curve is: in no case a > Gaussian curve and that there are strong 
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indications that this is a consequence of the influence of the con¬ 
stitution on the growth rate of the foliola. 


§2. The results of the experiments with Kalanchoe verti - 
cillata. 

On October 24, 3 weeks after the sprouts were planted the 
breadth of the largest ^cotyledon” was measured. We call coty¬ 
ledons” the two leaves which were formed while the sprout was 
still attached to the mother plant. The breadth of this largest 
„cotyledon” is a measure for the quantity of reserve food the 
sprout has stored. But as the sprouts were planted three weeks 
before the measurements, the breadth of the cotyledons” is also 
a measure for the velocity with which the roots were formed. 
For it is clear that all parts of a sprout will shrink as long as 
the quantity of water and foot it loses by respiration and 
transpiration is not compensated by the quantity of water it 
absorbs from the soil. And only if the sprout has formed roots 
it will be able to absorb any important quantity of water from 
the soil. If the cotyledons” are wide it may therefore be expected 
that the constitution of the plant is good at that moment. 
For probably it has formd its roots early, and the quantity of 
reserve food it contained when planted was not small. In our 
experiments it was visible during the whole life of the plants 
that plants whose cotyledons” were large were generally longer 
than plants whose cotyledons” were small. 

The frequency curve of the breadth of the largest cotyledon” 
was asymmetric to the right and, moreover bradykurtic (fig. 4). 
As the cotyledons” have grown during two different periods 
(on the mother plant and on the soil) which periods were separated 
by a period of shrinking (on the soil before the roots were formed) 
we have here a very complicated process. Therefore those 
who mean that in general the frequency curve must be Gaussian 
will perhaps not accept this frequency curve as a refutation 
of their theories. 

On the tenth of December the length of both leaves of the 
third pair was measured. Those are the first leaves which are 
situated just above the cotyledons”. (Kalanchoe verticillata is 
decussate). The mean leaf length of the leaves above the 
largest cotyledon” (which will be called Di) was 0.31 mm. 
less than the mean leaf length of the other leaves (which will 
be called D 2 ). As the frequency curve of the length of D 2 is 
situated (practically) entirely to the right of the frequency curve 
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Fig. 4. z-function of the width of the largest ’cotyledon*. Abscissa width 
in mm. Ordinate z. 

of Di this difference is real (fig. 6). Moreover in only 101 
cases out of 531 the difference D2—Di was negative. The frequency 
curve of D 2 —Di was very curious (fig. 5). 



Fig. 5. Frequency curve of D 2 —Di. Abscissa D 2 —t) r in mm. Ordinate 
number of plants. 
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It had two maxima, one at o mm. and one at 0.8 mm. We 
have not found any explanation of this phenomenon. But it 
illustrates very clearly that a frequency curve of homogeneous 
material may be bimodal. 

Neither the frequency curve of Di nor of D2 was normal. 
They were both skew and bradykurtic (fig. 6). 



Fig. 6 . z-function of D x o—o—o. z-function of D 2 -Abscissa length 

of leaves in mm. Ordinate z. 

1 

It is clear that those three frequency curves (of D 1} D 2 and 
D 2 —Di) disprove absolutely the theory that the normal fre¬ 
quency curve must be Gaussian. 

Nine times we have measured, in the course of our experiments, 
the length of the plants. This happened to yield a sufficient 
material to prove the validity of our objections against the 
theories of Quetelet and of K a p t e y n. The form of the 
frequency curve of the length of the plants was never Gaussian. 
The first measurement B gave an irregular frequency curve, all 
others however were distinctly bradykurtic. This should be a 
sufficient refutation of the theory of Quetelet (fig. 7). 

In our discussion on the frequency curve of the breadth of 
the largest „cotyledon” we have made plausible that plants with 
a different breadth of the „cotyledon” have a different constitu- 
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Fig. 7. z-functions for the length of Kalanchoe (all plants) after different 
periods, 

E. measurements of Jan. 11. O. measurements of April 30. 

G. measurements of Febr. 6 . Q. measurements of May 15. 

K. measurements of March 11. S. measurements of June n. 

M. measurements of March 27. T. measurements of July 4. 

Abscissa length in mm. Ordinate z. 


tion. If one chooses plants with a nearly equal breadth of the 
„cotyledon” (e.g. between 6 .6 j. —■ 7.10 mm.) and therefore with 
an equal constitution on October 24, which will be in that case 
the form of the frequency curve of the length of those plants? 
From fig. 8 it is clear, that even in this case the frequency 
curve is not Gaussian. 

We may conclude: The frequency curve of the length of a 
number of plants of Kalanchoe verticillata, which were (1) all 
of equal heredity, (2) of an equal constitution when planted, and 
(3) were grown under practically equal external conditions, is 
not Gaussian. Therefore the theory of Quetelet c.s. is unaccep¬ 
table in this case. And as we have found no examples, where 
a frequency curve is Gaussian, if all individuals satisfy the con¬ 
ditions A, the idea has to be rejected on other grounds as well. 

A second method to disprove the theories of Quetelet c.s. 
and of K a p t e y n is to demonstrate that the suppositions on 



Fig. 8. z-functions of length of plants which width of the largest ’coty¬ 
ledon* on October 24 was from 6.20 mm. to 6.6 5 mm. Abscissa length 
in mm. Ordinate z. B measurements of 16 Nov., for E—Q vide 
fig. 7. (107 Individuals). 

which they are based are not realized in nature. The theory of 
the adherents of Quetelet is based upon the supposition that 
the increase in length during a certain oeriod does not depend 
upon the increase in length during any previous period. There¬ 
fore it must be independent of the length of the plants. That 
is to say the mean growth rate during a certain period of 
plants, which had a different length at the beginning of the 
period, must be equal. Moreover, this mean growth rate must 
be the same for different periods. 

In the theory of Kapteyn it is supposed that the mean 
growth rate during a certain period, is a function of the length 
of the plants. Further it is supposed, that this mean growth rate 
of plants is equal in different periods. A corollary of the first 
supposition of Kapteyn is, that the variability of the growth 
rate of plants of the same length is the same function of the 
length as the mean growth rate. In fig. 9 and fig. 10 we have 
plotted, for the different periods between our measurements, the 
mean growth rate in /j. per diem against the length of the 
plants. Two things are evident from this figure: 

1. The mean growth rate is not constant. Therefore the 
theory of the adherents of Quetelet does not seem to apply. 
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Fig. 9. Mean growth rate as a function of the length for different periods. 

Abscissa length of the plants. Ordinate mean growth rate in fi 

per diem. 

B period between Nov. 1 6 and Jan. 11. 

E „ „ jan. 11 and Feb. 6. 

G „ „ Feb. 6 and March. 11. 

K „ „ March 11 and March 27. 

M „ „ March 27 and April 30. 

2. The mean growth rate is not only a function of the 
length of the plants, but it depends also upon their age. 
Therefore also the theory of Kapteyn does not hold here. 

Moreover the theory of Kapteyn does not suffice three 
other sets of data: 

1. According to Kapteyn the variability in the growth rate 
of plants of equal length is proportional to the mean growth rate. 
From table 1 (p. 119) (which gives the mean growth rate during 
the period G of plants of different length and the variability of 
the growth rate) it is evident, that variability and growth rate 
are not proportional. The variability is practically constant, the 
growth rate increases with the length of the plants. Our other 
measurements confirm this result. This should be a sufficient 
proof that the theory of Kapteyn is inapplicable. 
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2. Kapteyn has given a method to determine, from the 
form of the frequency curve, the mean growth rate of plants 
of a certain length. This mean growth rate is proportional to the 
cotangent of the line-tangent to the z-curve. If one has plotted 
the z-curve of the same material for two different moments, 
the calculated mean growth rate of plants of a certain length 
must be the same for both curves. (The mean growth rate does 
not depend on the time, according to K a p t e y n.). Therefore 
the slope of both z-curves should be the same for an equal 
abscissa. From fig. 7 and fig. 8 it is clear that this is not true. 

3. According to Kapteyn the constitution of a plant is 
unambiguously determined by its length. The probability that 



Fig. 10. Mean growthrate as a function of the length. Abscissa length of 
the plants. Ordinate mean growth rate in per diem. 

M period between March 27 and April 30. 

O „ » April 3c and May 15. 

Q „ ,, May 15 and June 11. 

S „ ,» June 11 and July 4. 
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Table 1. 


Length of pknts 
on Febr. 6. 

Mean growth ! 
rate Febr. 6.- 
March 11 . in 
ju per diem. 

Standard 
Deviation of 
mean growth 
rate 

Number of 
pknts. 

2 — 4 

25.4 


17 

4 — 6 

44.9 


39 

6 - 8 

59.8 

13 

69 

8—10 

78.2 

21 

88 

10—12 

93.2 

19 

103 

12—14 

100.4 

21 

104 

14—16 

111.4 

20 

61 

16—18 

111.8 

19 

41 

18—20 

122.7 


20 

20—22 

121.2 


2 


the growth rate of a plant is less than the growth rate of 
other plants of the same length is therefore always 3 ^ 2 • If the 
plants are classified according to their length after each measure¬ 
ment, the probability that the difference in rank of the same 
plant at two successive measurements is positive (or negative) 
is therefore If there are 8 measurements, the probability that 
there will be a positive (and (7-a.) negative) changes in rank, 
will be: 

7! 1 

a! (7-a)! * 2 7 

In table 2 is given the number of plants with (a) negative 
changes in rank as we have found in our experiments, and (in 
the third column) the theoretical number of negative changes 


Table II. 


Number of 
negative 
differences 
in rank. 
a 

Number of 
pknts, 
as found. 

Number of 
pknts, 

as calculated. 

0 

9 

3.5 

1 

43 

24.5 

2 

84 

. 73.5 

3 

90 

122.5 

4 

89 

122.5 

5 

88 

73.5 

6 

36 

24.5 

7 

10 

3.5 
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in rank. It is clear that there is a systematic difference between 
the theoretical and the experimental numbers. 3 and 4 negative 
deviations are too rare (% the theoretical number) all other 
occur too frequently. The explanation of this difference is 
probably that if, during a certain period a plant diminished 
in rank, the probability that during the following period it 
will again diminish in rank is more than Yz. This means that 
the constitution of a plant is not unambiguously determined 
by its length. Therefore this result proves that in our case at 
least the theory of Kapteyn is inapplicable. 

3. A third method to show the inapplicability of Q u e t e 1 e t’s 
theory is the following: 

We have classified the plants according to the breadth of their 
largest ,,cotyledon” on October 24. The frequency curves of the 
length of the plants on July 4 plotted for those different classes, 
showed to be distinctly unequal (fig. 11). This proves that the 
breadth of the „cotyledon” may be taken as a measure of the 
1.6 
I* 
i.2 
>0 
08 
0.6 
to 
02 
0 

•02 
-Oh 
-66 

•19 
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Fig. 11. z-function for the length of the plants on July 4, classified accor¬ 
ding to the width of the largest ’cotyledon on October 24. 

T. All plants (53 6 plants). 

I. Width of ’cotyledon* < 5.40 mm. (73 plants). 

II. „ „ „ 5.40 mm. — 6.20 mm. (105 plants). 

HI. „ „ „ 6.20 m.m. — 6.6s mm. (108 plants). 

IV. „ „ „ 6.65 mm. — 7.10 mm. (102 plants). 

V. „ „ „ 7.10 mm. — 7.60 mm. (83 plants). 

VI. „ „ „ > 7.60 m.m. (65 plants). 
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constitution of the plants on October 24, and that, if the diffe¬ 
rence in constitution was ivery large on that date, this difference 
was not annihilated after nine months. 

From this result it is clear that the theory of Quetelet which 
does not take into account the constitution of the organism, 
is unacceptable. 

The results of our countings of the number of leaves corro¬ 
borate the results of our measurements. As is clear from fig. 12 
a Gaussian curve was not obtained. 



Fig, 12. z-function for number of leaves. Abscissa number of leaves. Ordinate z. 
For F, H, L, N vide p. 106. 


SUMMARY AND CONCLUSIONS. 

1 . Theoretical Introduction. 

X. In § 1—8 of the introduction we 'have determinated which 
were the necessary conditions (p. 89 conditions A.) which must 
be satisfied by a group of plants that the frequency curve of any 
characteristic of those plants may have any practical value. If 
those conditions are satisfied it is possible to obtain the same fre¬ 
quency curve if the experiment is repeated. To simplify the dis- 
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cussion we have only considered the frequency curve of the length 
of the plants. 

2. In § io—13 we have discussed the theory that the frequen¬ 
cy curve should be identical with the error curve of Gauss. This 
theory was partly based upon experimental facts (e.g. by Que- 
t e 1 e t and B o r e 1 ). Later investigations have shown, however, 
that if the conditions A are satisfied the frequency curve is rarely, 
if ever, Gaussian. 

Other investigators suppose that the frequency distribution 
originates in the same manner as the error curve does. They divide 
the whole period of growth in a great number of elementary 
periods. If it is supposed that the increase in length of a certain 
plant during an elementary period is as likely more as less than 
the mean increase in length of all plants during that period, the 
frequency curve of the total length of the plants will be Gaus¬ 
sian. This supposition is equivalent with the hypothesis that the 
growth rate of a plant is only determined by the external factors, 
and not by the previous history of the plants. But this is cer¬ 
tainly not true. Therefore this proof of the theory that the form 
of the frequency curve is identical with the form of the error 
curve is unacceptable. 

3. In § 14 the theory of Kapteyn is discussed. Though a 
great improvement on the theory of Quetelet, the theory of 
Kapteyn was nevertheless found not to apply. The fundamental 
supposition that the mean growth of plants of the same length 
but of different age is equal had to be rejected. 

4. In § 13 we discussed the drawbacks of Pearson’s method of 

curve fitting. 1 

5. In § ij it was demonstrated that the method of Fisher 
and ’Student* is generally not applicable to biological pro¬ 
blems. For to apply this method it is necessary to know ’a priori’ 
the frequency distribution of the ’Universe’. 

6 . In § 16 we have discussed a practical method to compare 
frequency curves (the method of Kapteyn). 

II. The experiments . 

1. The length and breadth of the foliola of all fullgrown leaves 
of a shrub of Robinia Pseudacacia L. were measured. Neither the 
frequency curves of the length and of the breadth of the foliola 
nor the frequency curves of the differences in length and in 
breadth of two opposite foliola was Gaussian. As the foliola 
satisfy probably the conditions A this proves that the theory of 
Quetelet does not hold. 
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2. jj 2 plants of Kalanchoe verticillata S. Elliott orginating 
from the same P2 were grown under practically equal external 
conditions. The length of the plants was measured after nine 
consecutive periods. The mean growth rate during the eight sub¬ 
sequent periods depended both on the length as well as on the 
age of the plants, so both the theory of Quetelet and of 
K a p t e y n did not apply to this case. 

The frequency curves of the length were never Gaussian. 

3. Three weeks after the sprouts were planted the breadth of 
their largest ’cotyledon’ was measured. This breadth happened to 
be a good measure of the constitution (the internal condition as 
far as determined, by age and previous history) of the plants at 
that moment. For if the plants were classified according to the 
breadth the frequency curves of the length was always different 
for the different classes. 

We may conclude from our theoretical and experimental inves¬ 
tigation that it is impossible to determine ’a priori’ the form of 
the frequency curve of any characteristic of a group of plants. 

The foregoing investigations were carried out in the Botanical 
Laboratory of the Government University of Leyden. I am glad 
to be able to express here my sincere thanks to Professor Dr. L. 
G. M. Baas Becking for his constant help, suggestions and for his 
criticisms. I also want to thank Dr. C. D. Verrijp for his valuable 
advice and his help with the designing of the experiments, and 
Dr. J. B. D. Derksen for his advice in the mathematical part. 


APPEND. 


Derivation of formulae 

Van U v e n supposes in his derivation of the formula of 
Kapteyn that the increase in length of a plant of length x 
during an elementary period is m^{x). m indicates here the 
influence of the external conditions during this elementary period. 
>P (x) is the mean growth of plants of the length x. fi is a constant 
of such a magnitude that both m and f (x) may assume finite values. 

The magnitude of m during an elementary period is assumed to 
be independent of its magnitude during any former period. As 
we have said this will hardly ever happen in nature. We have 
assumed therefore (p. 99) that m may assume the values o to 2k. 
If m has, during a certain period, the magnitude a the probability 
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that during the succeeding period its magnitude will be a—i is 
and the probability that it will be a+i is — 2k a . 


Firstly it will be demonstrated that if, during a certain period, 
the frequency distribution is (symmetrically) binomial it will 
always remain (symmetrically) binomial. 

If the frequency distribution is binomial the probability that 


during a certain period m has the magnitude a+i is ~ ( a Ti) 
that m has the magnitude a is ^ (a) ’ anc ^ m has the 

magnitude a—i is ~ (a—l) * *^ ie P r °h a hility that during the 


following period m has the magnitude a is: 
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Therefore the probability that m has (during this period) the 
magnitude a is equal to the probability that m had the magnitude 
a during the previous period. 

In his derivation of the formula of Kapteyn Van U v e n has 
to calculate the sum of the quantities m over the whole period of 
growth. As he supposes that the quantities m are (during each 
elementary period) distributed according to the normal law, and 
that moreover the magnitude of m during an elementary period is 
independent of its magnitude during a previous period the sum 
of m (summed over the whole period of growth) will be distri¬ 
buted according the normal law as well. It will now be demons¬ 
trated that if the quantities m are dependent of one another as 
stated above their sum is still distributed according to the normal 
law. To do this we shall make use of a theorem of S. Bernstein 
(Bernstein 1927 p. 24): 
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’Soit S n = xi + x 2 + .... + x„ une somme de quantites depen- 
’dantes entre elles jouissant des proprietes suivantes: 

T. M (So) = Bo > Mn*. ohx> ~ 

’2. Quelles que soient les valeurs deja connues de certaines des 
’xk on peut fixer un nom’bre L, tel que, pour i > k, l’esperance 
’mathematique de |x?j reste inferieure h L. 

’3. Dans les memes conditions, on peut fixer un nombre N 
’tel que Ton ait: M (xi+i + Xi+2 + .... + Xi+ g ) 2 < N B n 
’quel que soit g. 

>1 

*4. Dans les memes conditions, pour i—k > nP , ou p> ~^~ est un 
’nombre fixe, la variation de l’esperance mathematique de Xj 
’ne depasse pas — ou^> 1——, et lorsqu’on a de plus j—k> n P 

’la variation de l’esperance mathematique du produit Xj Xj 
’ne depasse pas 

’Dans ces conditions la probability de l’inegalite 

z 0 1/2B7 < S„ < Zj |/2B7 

“"a pour limite 

i r z < 

l/V J e-z’dz 

z 

M (x) means the mathematical expectation of x. As the quan¬ 
tities x are supposed to have ’a priori’ the probable value o we 
will consider the summation of m—k which has also the probable 
value o ’a priori’. We will prove that the quantities m—k satisfy 
the four conditions of Bernstein. 

i- M (S 2 ) = M (x? + xf +-+ 2 xi x 2 + ..) 

= M (x 2 ) + M (x 2 ) + .... -T 2 M (xi X2) + • • 

The expectation of x? is always a positive quantity. The 
expectation of x; Xj is certainly not negative for the probability 
that Xi and xj have the same sign is greater than the probability 
that they have a different sign (vide 4). Therefore M (Sn) is the 
sum of n positive quantities and y 2 n (n—1) quantities which are 
certainly not negative. Therefore 

M (Sn) - > Mn 1 wh.:re X = 1 and 
where M is the smallest of the quantities M (x?). 



126 


The quantities x=m—k satisfy, therefore, the first condition 
of Bernstein. 

2. The expectation of |x 3 | is always finite as xi can only 
assume finite values. 

3 - M + x *+2 + - + x i + g ) 2 ^ 

M (x a + x 2 + . + xn ) 2 \ n / 

If g is finite it is of course always possible to chose N so that 
this inegality is satisfied. If g tends to oo one has, in the nume¬ 
rator and in the denominator, functions of the same form. 
It will be shown in 4. that the expectation of Xi +C tends to- 

(a—k) (“Y") C whatever the value of Xi is, when c tends to 00 . There¬ 
fore the expectation of xi+k Xj tends to zero if j—i tends to 00 
and k remains finite. If k tends also to 00 this product tends- 
to the same value whatever the value of x is. The expectation 
of all except a finite number of products xi Xj tends therefore to 
the same value in numerator and denominator. The expectation of 
x? of the numerator is less than k 2 . Therefore it is possible to 
choose for N such a value that condition 3 is satisfied. 

4. To calculate the mathematical expectation of x t if the value 
of Xk is known, it is easiest to replace our model by another, 
equivalent, model. 

Suppose there are 2k numbered balls, part of which are in a 
vase. The number of balls in the vase is equal to the magnitude 
of m during a certain period. There is moreover a second vase 
with 2k tickets numbered from 1 to 2k . To determine the magni¬ 
tude of m at a certain moment if at the previous moment it was rri 
a ticket is drawn. After the number is read the ticket is replaced 
in its vase. If the ball with the same number was in the vase 
it is now taken out. The number of balls in the vase, and there¬ 
fore the magnitude of mm , is in this case mi —1. The probability 

that this happens is If the ball with the same number as the 

ticket was outside the vase it is now placed in it. The number of 
balls and therefore the magnitude of mm is in this case mi +1. 

ID 

The probability that the above happens is — — 1 . The proba¬ 
bility that at a certain moment there are a balls in the vase is 
therefore equal to the probability that m has the magnitude a 
on that moment. 

The probability that a certain ball which was in the vase will 
be again in the vase after 2m tickets are drawn is the probability 
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that its ticket is drawn an even number of times: 

1 2m(m—1) /l\2m—2 /2k-l\2 , 

(2k)2"> 1.2 \2k/ \ 2k"/ " t ’*'*'* 

The probability that a ball which was in the vase will not be 
in the vase is the sum of the probabilities that its ticket is drawn 
an odd number of times: 

2m/ 1 \2m—1 / 2k—1 \ 2m (2m—1) (2m—2) / 1 \2m-3 / 2k- 1 X3^ 

1 \2kj \ 2k /"*’ 1.2.3. \2k/ \ 2k / . 

The differense of the probabilities that the ball is in the vase 
and that it is not in the vase is: 


/ 1 \2m 2m/ 1 \2m—1 / 2k— 1 \ 

\2k/ — 1 V2k/ \ 2k /+ * 

/ 1 2k— l\2m_ /2k—2 \ 2m_ / 1 \2m 

\2k~^) ~\ 2\<r) ~v~y) 


If m tends to oo the difference of those probabilities tends to 
zero. This means that, if a certain ball was originally in the vase, 
it is after a sufficient long time nearly as probable that it should 
be inside as outside. 

If 2m + i tickets are drawn the probability of this difference Is 

l\2m +1 

k / 

If a certain ball is originally not in the vase one may prove 
in the same manner that the* probability that it is in the vase 
after a sufficiently long time is (practically) as great as the 
probability that it is not in the vase. 

The mathematical expectation of the number of balls in the 
vase after m periods, if originally there were a balls in the vase is 



=i8+iaKr + k _ k(l _x)- a+ i aKr 

= k + (a-k) 

As the mathematical expectation of m is k the variation of the 
mathematical expectation is therefore (a—k) ^1— Therefore if 
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m > nP this variation is certainly less than — if n tends to oo # 

The variation of M (x t xj ), if the value of xt is known, will 
be of the same order of magnitude, for the value of x depends 
(directly) only upon the value of Xi (if j > i). Therefore the 
condition 4 of Bernstein is also satisfied. 

This means that the sum of the quantities m which depend on 
another in the manner stated, satisfies the Gaussian law also in 
this case, and that, therefore, the result of Van Uiven may be 
retained. 
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